How Does the Data Sampling Strategy Impact the Discovery of Information Diffusion in Social Media?

نویسندگان

  • Munmun De Choudhury
  • Yu-Ru Lin
  • Hari Sundaram
  • K. Selçuk Candan
  • Lexing Xie
  • Aisling Kelliher
چکیده

Platforms such as Twitter have provided researchers with ample opportunities to analytically study social phenomena. There are however, significant computational challenges due to the enormous rate of production of new information: researchers are therefore, often forced to analyze a judiciously selected “sample” of the data. Like other social media phenomena, information diffusion is a social process–it is affected by user context, and topic, in addition to the graph topology. This paper studies the impact of different attribute and topology based sampling strategies on the discovery of an important social media phenomena–information diffusion. We examine several widely-adopted sampling methods that select nodes based on attribute (random, location, and activity) and topology (forest fire) as well as study the impact of attribute based seed selection on topology based sampling. Then we develop a series of metrics for evaluating the quality of the sample, based on user activity (e.g. volume, number of seeds), topological (e.g. reach, spread) and temporal characteristics (e.g. rate). We additionally correlate the diffusion volume metric with two external variables–search and news trends. Our experiments reveal that for small sample sizes (30%), a sample that incorporates both topology and usercontext (e.g. location, activity) can improve on naı̈ve methods by a significant margin of ∼15-20%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Knowledge Management Approach to Discovering Influential Users in Social Media

A key step for success of marketer is to discover influential users who diffuse information and their followers have interest to this information and increase to diffuse information on social media. They can reduce the cost of advertising, increase sales and maximize diffusion of information.  A key problem is how to precisely identify the most influential users on social networks. In this pape...

متن کامل

Factors Affecting Social Commerce and Exploring the Mediating Role of Perceived Risk (Case Study: Social Media Users in Isfahan)

Owing to the ever-increasing prevalence of social media use, social commerce has become an important part of e-commerce. This study endeavors to explore the impact of social media quality and social support on the social commerce (SC) intention directly and through the variable of perceived risk. The sample included 214 social media users in Isfahan collected through simple random sampling meth...

متن کامل

Identifying the Challenges of Social Development in the Faculty Members of Tehran's Comprehensive Universities: Examining the Phenomenon bullying

The purpose of this study is to investigate the social phenomenon of bullying among faculty members. In this way, various aspects of these issues were tried and presented with respect to the context of the institution of university in Iran. The present study was conducted within the framework of the qualitative approach and using the fundamental theory research method (with a Strauss and Corbin...

متن کامل

A Study on the Use of Social Media to Understand Consumer Preference: The Case of Starbucks

The paper seeks to identify Starbuck's experience in using social media, understand how social media is linked to customer knowledge management, and assess how social media services could have contributed to Starbucks success. Starbucks demonstrates versatility to engage customers and support different part of customer knowledge management strategy through various social media platforms, such a...

متن کامل

The Effect of Social and Cultural Factors on Generation Gap

This study focuses on the effect of social and cultural determinants on generation gap in Tehranian families in 2011. The purpose of this study is to determine effective factors on generation gap in Tehranian families, analytical and empirical patterns and be surveyed by related theories and effective factors. The research was prepared by questions including whether there is a relationship betw...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010